14 research outputs found
Assessing the influence of phonetic variation on the perception of spoken threats
In spite of the belief that there is such a thing as a âthreatening tone of voiceâ (Watt, Kelly and Llamas, 2013), there is currently little research which explores how listeners infer traits such as threat from speakersâ voices. This thesis addresses the question of how listeners infer traits such as how threatening speakers sound, and whether phonetic aspects of speakersâ voices can play a role in shaping these evaluations. Additionally, it is sometimes the case that a victim of a crime will never see the perpetratorâs face but will hear the perpetratorâs voice. In such cases, attempts can be made to get the witness or victim to describe the offenderâs voice. However, one problem with this is whether phonetically untrained listeners have the ability to accurately describe different aspects of speakersâ voices. This issue is also addressed throughout this thesis.
Over five experiments, this thesis investigates the influence of a range of linguistic and phonetic variables on listenersâ evaluations of how threatening speakers sounded when producing indirect threat utterances. It also examines how accurately phonetically-untrained listeners can describe different aspects of speakersâ voices alongside their evaluative judgements of traits such as threat and intent-to-harm. As well as showing that a range of linguistic and phonetic variables can influence listenersâ threat evaluations, results support the view that caution should be adopted in over-reliance on the idea that people will âknow a threat when they hear oneâ (Gingiss, 1986:153). This research begins to address the phonetic basis for the perceptual existence of a âthreatening tone of voiceâ, along with how listeners evaluate and describe voices in earwitness contexts. Suggestions are made at the end of the thesis for improvements in the elicitation and implementation of accurate, meaningful information about speakersâ voices from linguistically-untrained listeners in evaluative settings involving spoken threats
Assessing the abilities of phonetically untrained listeners to determine pitch and speaker accent in unfamiliar voices
It is sometimes the case that a victim of a crime will never see the perpetratorâs face, but will be exposed to his or her voice. This could occur in situations such as masked robberies, telephone fraud, or the receipt of bomb threats via phone or voicemail. In such cases, attempts can be made by the police or intelligence services to get the witness or victim to describe the voice of the offender. However, there is a high likelihood that a given earwitness will lack the linguistic expertise and technical vocabulary of the kind used by trained phoneticians when they describe voices. One question that arises from this problem is whether phonetically untrained listeners have the ability, using verbal means, to accurately capture different aspects of speakersâ voices. This paper presents an experiment in which a group of listeners were tasked with assessing how âhigh-pitchedâ the voices of 12 speakers were, along with providing a description of each speakerâs accent. These assessments were then compared to measured Fundamental Frequency (F0) values and prior knowledge of the speakersâ accents in order to assess listener performance. The results suggest that while some listeners have the ability to make reliable judgements of relative vocal pitch, the overall correlations between measured (F0) and perceived pitch were weak. With regard to accent, the results suggest that the more unfamiliar a speakerâs accent is to the listener, perhaps owing to the geographical distance of the area where the accent is spoken from the listenerâs place of origin, the harder it will be for the listener to accurately describe that accent. We argue that testing the abilities of earwitnesses to assess aspects of speakersâ voices before their descriptions are used further would be a useful safeguard against the use of potentially inaccurate or erroneous earwitness evidence in police investigations
'For the Record': applying linguistics to improve evidential consistency in police investigative interview records
The âFor the Recordâ project (FTR) is a collaboration between a team of linguistic researchers and police in the England & Wales jurisdiction (E&W). The aim of the project is to apply insights from linguistics to improve evidential consistency in police interview transcripts, which are routinely produced by transcribers employed by the police. The research described in this short report is intended as a pilot study, before extension nationally. For this part of the project, we analysed several types of data, including interview audio and transcripts provided by one force. This identified key areas where current transcription practice could be improved and enhanced, and a series of recommendations were made to that force. This pilot study indicates that there are three core components of quality transcription production in this context: Consistency, Accuracy, and Neutrality. We propose that the most effective way to address the issues identified is through developing new training and guidance for police interview transcribers
Perceptual instability in police interview records: Examining the effect of pauses and modality on peopleâs perceptions of an interviewee
This article examines whether the representation of linguistic features within transcripts and audio recordings of police interviews can influence peopleâs perceptions of the interviewee. We specifically examine the influence of the representation of pauses through an experimental methodology. Participants were presented with a police interview either in audio format or in one of a series of transcript formats and asked to make a series of judgements about the interviewee. We manipulated both the presence and representation of pauses within the audio and transcript stimuli to assess how this would influence perceptions. Results showed differences between perceptions of the interviewee in the audio and transcript conditions, and that different representations of pauses within transcripts created perceptual instability between participants. The findings illustrate that the presence and representation of linguistic features in transcripts can affect perceptual judgements. We argue this should be explicitly considered by those using transcripts within the legal system
The role of face and voice cues in predicting the outcome of student representative elections
First impressions formed after seeing someoneâs face or hearing their voice can affect many social decisions, including voting in political elections. Despite the many studies investigating the independent contribution of face and voice cues to electoral success, their integration is still not well understood. Here, we examine a novel electoral context, student representative ballots, allowing us to test the generalisability of previous studies. We also examine the independent contributions of visual, auditory and audiovisual information to social judgements of the candidates, and their relationship to election outcomes. Results showed that perceived trustworthiness was the only trait significantly related to election success. These findings contrast with previous reports on the importance of perceived competence using audio or visual cues only in the context of national political elections. The present study highlights the role of real-world context and emphasises the importance of using ecologically valid stimulus presentation in understanding real-life social judgement
Voice as a design material : sociophonetic inspired design strategies in Human-Computer Interaction
While there is a renewed interest in voice user interfaces (VUI) in HCI, little attention has been paid to the design of VUI voice output beyond intelligibility and naturalness. We draw on the field of sociophonetics - the study of the social factors that influence the production and perception of speech - to highlight how current VUIs are based on a limited and homogenised set of voice outputs. We argue that current systems do not adequately consider the diversity of peoplesâ speech, how that diversity represents sociocultural identities, and how voices have the potential to shape user perceptions and experiences. Ultimately, as other technological developments have influenced the ideologies of language, the voice outputs of VUIs will influence the ideologies of speech. Based on our argument, we pose three design strategies for VUI voice output design - individualisation, context awareness, and diversification - to motivate new ways of conceptualising and designing these technologies
Perception of threat and intent to harm from vocal and facial cues
What constitutes a âthreatening tone of voiceâ? There is currently little research exploring how listeners infer threat, or the intention to cause harm, from speakersâ voices. Here, we investigated the influence of key linguistic variables on these evaluations (Study 1). Results showed a trend for voices perceived to be lower in pitch, particularly those of male speakers, to be evaluated as sounding more threatening and conveying greater intent to harm. We next investigated the evaluation of multimodal stimuli comprising voices and faces varying in perceived dominance (Study 2). Visual information about the speakerâs face had a significant effect on threat and intent ratings. In both experiments, we observed a relatively low level of agreement among individual listenersâ evaluations, emphasising idiosyncrasy in the ways in which threat and intent-to-harm are perceived. This research provides a basis for the perceptual experience of a âthreatening tone of voice,â along with an exploration of vocal and facial cue integration in social evaluation. </jats:p
Understanding the role of transcription in evidential consistency of police interview records in England and Wales
Evidential records of investigative interviews serve an important institutional purpose within the legal system in England and Wales. Academic scholars have long recognized that little institutional attention is paid to the transformation process that occurs when written records of the spoken are produced, nor to the potential impact this has on later interpretation by users of the records during the investigation of crimes and later in court. We analyse 29 digitally recorded investigative interviews and their corresponding official written âRecord of Taped/Videoed Interviewâ (ROTI/ROVI) transcripts, taking an ethnomethodological, conversation analytic (CA) approach to examine the social actions that are transformed in this activity by comparing the audio record of police interview evidence to the written transcripts. The intended outcome of this work, within the wider project of which this forms, is to better understand this process within the legal system, and to incite improvements